NEGATIVE ASSOCIATION IN UNIFORM 
FORESTS AND CONNECTED GRAPHS 



G. R. Grimmett and S. N. Winkler 

Abstract. We consider three probability measures on subsets of edges of a given 
finite graph G, namely those which govern, respectively, a uniform forest, a uniform 
spanning tree, and a uniform connected subgraph. A conjecture concerning the negative 
association of two edges is reviewed for a uniform forest, and a related conjecture is 
posed for a uniform connected subgraph. The former conjecture is verified numerically 
for all graphs G having eight or fewer vertices, or having nine vertices and no more than 
eighteen edges, using a certain computer algorithm which is summarised in this paper. 
Negative association is known already to be valid for a uniform spanning tree. The 
three cases of uniform forest, uniform spanning tree, and uniform connected subgraph 
are special cases of a more general conjecture arising from the random-cluster model of 
statistical mechanics. 



1. Three random subgraphs 

Throughout this paper, G = {V, E) denotes a finite labelled graph with vertex set V 
and edge set E. An edge e with endpoints x, y is written e = {x, y). We assume that 
G has neither loops nor multiple edges. We shall consider three probability measures 
on the set of subsets of E, and shall discuss certain results and conjectures concerning 
these measures. Since each such measure is a uniform measure on a given subset of E, 
each of our conclusions and conjectures may be expressed as a purely combinatorial 
statement. 

The three measures are given as follows. Let be the set of all subsets of E which 
contain no cycle, noting that the elements of JF are exactly the subgraphs of G which 
are forests. Let C be the set of all subsets C of E such that {V, C) is connected, and 
let T = !Fr)C be the set of all spanning trees of G. We write F, C, T (respectively) for 
elements of JF, C, T chosen uniformly at random, and we call them a uniform forest, 
a uniform connected subgraph, and a uniform spanning tree, respectively. Note that 
all subgraphs of G = {V, E) considered in this paper have the full vertex set V, and 
are thus said to be 'spanning'. 

The uniform spanning tree has been studied extensively, see [3, 18] for example. 
The uniform forests of this paper are different from those of [3] in that, in [3], the 
term uniform spanning forest denotes effectively the probability measure on infinite 
subsets of a given infinite graph G which is obtained as the infinite- volume weak limit 
of the uniform spanning tree on a finite subgraph of G. 
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We remind the reader that, as summarised in Section 4, these three measures arise 
as weak hmits of the random-cluster measure on G in certain hmiting regimes of the 
parameters of this measure. 

The random subset 5 of i? is said to be edge-negatively-associated if 

P(e e 5, / e ,5) < P(e e 5)P(/ e S) for all e,f eE, ey^f, 

where P denotes the appropriate probability measure. 

There is a more general notion of negative association, as follows. Random subsets 
of E take values in the power set of E which we denote by O^; = {0, l}"^. An element 
u = {uj{e) : e G -B) G fis is identified with the set 77(0;) = {e E E : uj{e) = 1} of 
edges e with u){e) = 1. An event is a subset of Qe- For E' <Z E and an event A, we 
say that A is defined on E' if: for all a;, a;' G ri^;, if oj'{e) = a;(e) for all e G E\ then 
either a;, a;' G ^ or a;,a;' ^ A. The sample space 17s is a partially ordered set with 
order relation 

Ul> <uj' if a;(e) < a;'(e) for all e E E. 

An event A is called increasing if G A whenever u' > u and u & A. 
We call the random subset S of E negatively associated if 

¥(S eAnB)< F(S G A)F{S G B) 

for all pairs A, B of increasing events with the property that there exists E' C E 
such that A is defined in E' and B is defined on its complement E' = E\E' . For 
a general account of negative association and its inherent problems, see [20], which 
includes that part of the following conjecture concerning the uniform forest F. 

Conjecture 1.1. For all finite graphs G = {V,E), the uniform forest F and the 
uniform connected subgraph C are edge-negatively-associated. 

A stronger version of this conjecture is that F and C are negatively associated in 
the general sense described above. 

Since F and C are chosen uniformly at random. Conjecture 1.1 may be re- written 
in the form of two questions concerning subgraph counts. Let V = {l,2,...,n}, and 
let K be the set of AT = edges of the complete graph on the vertex set V. Let Q{V) 
denote the set of all graphs on V with neither loops nor multiple edges; since Q{V) is 
in one-one correspondence with the set of all subsets of K, we shall take Q{V) to be 
this power set. Thus we speak of subsets of K as being graphs on V . Let E <0 K. For 
X CE, let = M^{E) be the number of subsets E' of E with E' D X such that 
the graph (V, E') is connected. Edge- negative-association for connected subgraphs 
amounts to the inequality 

(1.2) M^^'-^^M^ < M^M^' for aU ej e E, f. 

Here and later in this context, singleton sets are denoted without their braces, and 
any empty set is suppressed. 

In the second such question, we ask if the same inequality is valid with 
redefined as the number of subsets E' containing X such that {V, E') is a forest. (See 
[15, 20].) 
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With E fixed as above, and with X,Y C E, let = M{> (E) denote the number 
of subsets E' of E of the required type such that E' ^ X and E' HY = 0. Inequahty 
(1.2) is easily seen to be equivalent to the inequality 



The corresponding statement for the uniform spanning tree is known. 
Theorem 1.4. The uniform spanning tree T is negatively associated. 

Thus was proved in [7], see also [3]. The lesser statement of edge- negative- 
association of T is a consequence of the 1847 work of Kirchhoff [16] concerning electri- 
cal networks. See Section 3 for a brief account of the proof of edge-negative association 
for the uniform spanning tree. 

The main advance reported in this note is the following. 

Theorem 1.5. If G = {V, E) has eight or fewer vertices, or has nine vertices and 
eighteen or fewer edges, then the uniform forest F has the edge-negative-association 
property. 

This we have verified by direct numerical calculation, using a combination of com- 
puter algorithms described in Section 5. That is, for every graph G = {V, E) with 
eight or fewer vertices, and for e, / G E, we have used a computer to count the 
numbers of forests appearing in (1.2), and we have thus checked that (1.2) is valid. 
This non-trivial calculation was achieved via a method summarised in Section 5 and 
expanded further in Sections 6 and 7. The central difficulty is that there is currently 
known no computationally efficient way of counting the number of forests contained 
in a given graph. Indeed, this problem is #P-complete and presumably computa- 
tionally intractable, see [1, 2, 21] and also [14]. Thus, some cleverness is needed to 
complete the computations in a reasonable time. In contrast, counting the number of 
spanning trees of a graph is equivalent to the evaluation of a certain determinant, a 
computationally easy problem, see [17]. 

We point out that there are 2^^ labelled graphs on eight vertices, and 2^^ on nine 
vertices. Our computation for graphs of size eight required about 40 minutes on a PC 
running Linux, using a Pentium 3 processor with an official clock speed of 2.8GHz, 
and with approximately 2Gb RAM. Neither speed nor memory capacity was tested 
by the case n — 8. The case n — 9 proved much more demanding in computational 
terms, and exhausted the RAM after running for about 9 hours and completing the 
calculation for subgraphs having 16 or fewer edges. After taking certain measures 
which preserve RAM at the expense of speed, we were able to handle subgraphs with 
18 edges or fewer in approximately 140 hours. 

There arc more general versions of Conjecture 1.1. First, one might ask whether 
the following holds instead of (1.2), 



(1.3) 



(1.6) 



)M^{a) for e 7^ /, < a < oo. 



where 



E 



a 



\E'\ 



E':XCE'CE 
{V,E') has property n 
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and n is either the property of either being a forest or the property of being connected. 
This reduces to (1.2) when a = 1. We shall encounter such a generalisation in the 
context of the random-cluster measures in Section 4. 

For a more general formulation, let p = {pe : e G K) be a collection of non-negative 
numbers indexed by K, and let E C K. For a subset E' C E, we write 

uiE') = n P'^- 

eeE' 

We now ask whether (1.2) holds with = M^{p) defined by 
(1-7) M^(p)= Yl 

E'-.XCE'CE 
{V,E') has property 11 

with n as above. Note that (1.2) becomes a polynomial inequality in real variables. 
We are grateful to Alan Sokal for his encouragement to formulate the problem in this 
manner. Such a formulation is natural when the problem is cast in the context of the 
Tutte polynomial, see Section 4. 

We have discussed only graphs G — (V, E) with neither loops nor multiple edges. 
No extra generality is gained in the consideration of edge-negative-association by 
introducing loops, since no forest contains loops, and in addition the inclusion or not 
of loops has no effect on whether a given subgraph is connected. The situation with 
multiple edges is more interesting, and is related to the general form of the conjecture 
of edge-negative-association discussed around (1.7). 

2. Planar duality 

The graph G = {V,E) is called planar if it may be drawn in the plane in such a way 
that edges intersect only at their endpoints. Suppose that G is planar and has been 
embedded in the plane in such a way. We write G also for the planar embedding of 
G, and we obtain the dual graph G'^ = {V'^,E'^) as follows. We place a dual vertex 
within each face of G, including the infinite face. For each e E E we place a dual 
edge e*^ = {x^, y^) joining the two dual vertices lying in the two faces of G abutting 
e; if these two faces arc the same, then x'^ = y'^ and e'^ is a loop. Thus E"^ is in 
one-one correspondence to i?, and G'^ is itself planar. Any configuration u E VLe 
{— (0, 1}^) gives rise to a dual configuration uj"^ lying in the space VL"^ — (0, 1}^ 
defined by w'^(e'^) = 1 — a;(e). As before, to each configuration uj^ corresponds the 
set 'q{uj^) = {e^ e E^ : Lij^{e^) = 1} of its 'open edges'. Note that a given planar 
graph may have more than planar embedding, and that different cmbcddings may 
have non-isomorphic dual graphs. It is easily seen that G is the dual graph of G"^. 

Theorem 2.1. A configuration lv E Q corresponds to a forest of G if and only if the 
dual configuration u>'^ corresponds to a connected subgraph of G"^ . 

Proof. If rj{u) contains a cycle 7, then the dual vertices corresponding to faces of G 
inside 7 are disconnected from those corresponding to faces outside 7. Thus the graph 
(F*^, 77(0;*^)) is disconnected. It is easy also to see the converse: if r]{u)) contains no 
cycle, then {V^,ri{uj^)) is connected. □ 
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Let X,Y C E. We deduce from Theorem 2.1 and its proof that the number 

of forests of the planar graph G containing every edge in X but no edge of Y is 
equal to the number of connected subgraphs of G"^ containing every edge of — 
{/^ : / G Y} but no edge in — {/^ : / G X}. It follows by the equivalence of 
(1.2) and (1.3) that, for any planar graph G, the uniform spanning forest F is edge- 
negatively-associated if and only if the uniform connected subgraph C"^ of G'^ has 
this property. In particular, Theorem 1.5 implies the edge- negative-association of the 
uniform connected subgraph of any planar graph with eight or fewer faces (including 
the infinite face) whose dual has no multiple edges. 

3. Edge-negative-association of uniform spanning tree 

Since the emphasis of this note is upon the property of edge-negative-association, we 
include a very short account of its proof for the uniform spanning tree. Kirchhoff [16] 
showed how to relate the current which flows along the edge e in an electrical network 
to certain counts of spanning trees. Consider an electrical network on the connected 
graph G in which each edge corresponds to a unit resistor. The relevant fact from 
the theory of electrical networks is that, if a unit current flows from a source vertex 
s to a sink vertex t, then the current which flows along the edge e = {x, y) in the 
direction xy equals N{s,x,y,t)/N, where N is the number of spanning trees of G 
and N{s, u, v, t) is the number of spanning trees whose unique path from s to t passes 
along the edge (w, v) in the direction uv. 

Let e = {x,y). By the above, P(e G T) equals the current flowing along e when 
a unit current flows through G from source x to sink y. By Ohm's Law, this equals 
the potential difference between x and y, which in turn equals the effective resistance 
Rg{xi y) of the network between x and y. 

Let f e f ^ e, and denote by G.f the graph obtained from G by contracting 
the edge /. There is a one-one correspondence between spanning trees of G.f and 
spanning trees of G containing /. Therefore, P(e E T \ f E T) equals the effective 
resistance RG.f{x,y) of the network G.f between x and y. The so-called Rayleigh 
principle states that effective resistance is a non-decreasing function of the individual 
edge-resistances, and it follows as required that RG.f{x,y) < RG{x,y). 

We note that the usual proof of the Rayleigh principle makes use of the Thom- 
son/Dirichlet variational principle, which in turn asserts that, amongst all unit flows 
from source to sink, the true current flow of size one is that which minimises the 
dissipated energy. 

A good account of the Kirchhoff theorem, above, may be found in [4]. Further 
accounts of the mathematics of electrical networks include [6] and [18], the latter 
containing also much material about the uniform spanning tree. 

4. The random-cluster measure 

Let = {0,1}^ be the set of 0/1-vectors indexed by E, and let uj G O^. An 
edge e is termed open in u if a;(e) = 1, and is termed closed otherwise. We write 
r]{u>) — {e E E : u;(e) = 1} for the set of open edges of u>, and k{u;) for the number of 
components of the open graph (F, 77(0;)). Amongst the various probability measures 
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on flEi probably the most studied is product measure /ip with some given density p. 
The subsequent theory is usuaUy referred to either as 'percolation' when the graph 
is part of a lattice, or as 'random graphs' when G is complete; see [5, 9, 13]. An 
important extension of product measure is the so-called random-cluster measure given 
as follows. Let < p < 1 and q > 0. We define the probability measure 0p g on fi^ 

by 

where the 'partition function' Z(p, q) is given by 

Zip. 9) = E I n p^^''^ (1 - pv-^^Aq"^^^- 

This measure was introduced around 1970 by Fortuin and Kasteleyn, and has proved 
very useful in the study of Ising and Potts models. When q = 1, (f)p,q is simply product 
measure with density p. When q G {2, 3, . . . }, (pp^q corresponds in a certain way to a 
Potts model on G with q states available at each vertex. It is a very useful property 
that (f)p^q is positively associated when q > 1, and this property does not hold when 
q < 1 (except for graphs with no cycles). Only little is known about the measure (f)p^q 
when q < 1, although a few facts can be proved. See [10] for a history and a review 
of the measure, and [11] for a more extensive account with proofs. 

We show next as in [10] that the uniform forest, uniform connected graph and 
uniform spanning tree are obtained as weak limits of (f)p^q as p, g | 0. This was known 
to Fortuin and Kasteleyn for spanning trees, see [8, 12], and to Pemantle [20] and 
perhaps others for forests. 

Suppose that < p < 1, and consider the weak limit of (j)p^q as g | 0. Because of 
the term the definition of all the mass is concentrated in the limit on 

those configurations u for which k{uj) is a minimum, that is, with k{uj) = 1. Since 
the 'weight' of a configuration uj is proportional to {p/{l —p)}\'^^^^\q''(^\ the limiting 
probability measure is given, with /? = p/{l — p), by 

Zc,f3 

where is the set of all a; such that {V,ri{u>)) e C, the set of connected subgraphs, 
and 

Thus fj,c,j3 is concentrated on the set of connected subgraphs of G, and iJ,c,i is uniform 
measure on C. 

Suppose secondly that p = pq is related to q in such a way that p and q/p ^ 
as g — > 0. The partition function Z{p, q) may be expressed in the form 
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Note that p/ {1 — p) ^0 and q{l — p) /p ^ as q ^ 0. Now k{uj) > 1 and | + 
k{uj) > \V\ for all a; G O; these two inequalities are satisfied simultaneously with 
equality if and only if r]{uj) is a spanning tree of G. It follows that, in the limit as 
g — > 0, the 'mass' is concentrated on such configurations, and it is easily seen that 
the limit mass is uniformly distributed. That is, lim^j^o (t^p,q is a probability measure 
which selects, uniformly at random, a spanning tree of G. 

Next we suppose p = q and consider the limit as g | 0. The important component 
in the weight of oo is gl''(^)i+'^('^). Now + k{uj) > \V\, with equality if and only 

if the set of open edges contains no cycle. It follows that cpp^q converges weakly as 
g I to the probability measure 

where Ojf is the set of all u such that 77(0;) G JF, the set of forests in G. Thus fx^p^i 
is uniform measure on JF. More generally, take p = aq where a G (0, 00) is constant, 
and take the limit as g | 0. The limiting measure is now 

where 

It may be conjectured that (f)p^q is negatively associated in the sense of [7] when 
q < I, and in particular that (/)p^g is edge- negatively-associated in that 

4>p,q{e is open, / is open) < 4>p^q{e is open)(^p^g(/ is open), 

for distinct edges e, /. A verification of this conjecture would be valuable in studying 
the random-cluster measure with q < 1. 

The partition function is an evaluation of the Tutte polynomial of the graph G, and 
for this reason and others there is a strong relationship between the random-cluster 
model and such polynomials. We do not describe this here, but refer the reader to 
the references in [10, 11]. 

5. The algorithm for counting forests 

We summarise next our approach to the computations necessary for Theorem 1.3. 
The computer code and some additional documentation is available at the web page 
http : //www . statslab . cam, ac .uk/ ^grg/usf / . 

To verify (1.3) for all edge sets E on n vertices is a numerical problem of con- 
siderable complexity. We give a brief summary of such an algorithm which, by dint 
of careful book-keeping, confirms (1.3) whenever 71 < 8. Our general approach is 
sketched in this section, and elaborated in Sections 6 and 7. 
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5.1 Overview 

Let V = {1, 2, . . . ,n} where n > 3, and let K be the set of N = (2) edges of the 
complete graph on V, as before. The edge e of with endpoints x and y such that 
X < y is written e = {x,y). There is a total (lexicographic) order on K given by 
{u, v) < {x, y) if either u<xoru — x^v<y. For X, F C we write Ty {E) for the 
set of all forests F (C E) such that X C F and F n F = 0, and we refer to X and Y 
as the 'constraints' of such forests. We write MyiE) = (F)|. If either X or y is 
empty, we omit it from the notation. We shall sometimes write A + B for the union 
of two sets A, B, and A — B for A \ B, and we shall generally omit braces around 
singleton sets when no ambiguity is thus introduced. 
It is required to prove (1.3), namely, 

(5.1) M^'^'fHE)M^^j^{E) < M}{E)Mi{E) 

for all e, f E E, e ^ f, and for all E C K. That is, we seek to validate (5.1) for all 
suitable triples [E, e, /), and we refer to such a triple as a 'conjecture instance'. Note 
that conjecture instances may be classified into two categories, depending on whether 
or not e and / share a vertex. With this in mind, we set Ei — {ei, /i} = {(1,2), (1,3)} 
and E2 = {e2, /2} = {(1, 2), (3, 4)}, each such set being representative of one of the 
two categories. 

The general approach is as follows. For each s — 2,3, . . . , N in order, we generate 
the family Sg of all subsets of K with cardinality s and containing either Ei or E2. 
For each s, i, and each E E £s satisfying Ei C E, we check (5.1) for the conjecture 
instance {E, e^, fi). To do so requires computations of the four numbers My^~ [E] 
for y = 0, [ei], [fi], Ei. We expand on such computations in Subsection 5.2. 

This method, if implemented naively, would be computationally infeasible in prac- 
tice, since the size of the problem grows very quickly with n, and since no efficient 
method is known for calculating quantities of the form MyiE). We propose two 
substantial economies of scale. The first aims to reduce the number of conjecture 
instances to be studied, (a) by eliminating instances which are 'isomorphic' to an 
instance already resolved, and (b) by developing a criterion for deciding when two 
conjecture instances are effectively equivalent, in the sense that (5.1) holds for the 
second whenever it holds for the first. This is discussed in much more detail in Section 
7. 

Our second economy pertains to the calculation of the terms My (E). As described 
in Subsection 5.4, such a calculation involves many sub-calculations, each of which 
may have been encountered earlier in the same or some related form. We aim to keep a 
record of certain earlier calculations, the better to avoid duplication of computational 
effort. 

Each of these measures hinges on the definition of a suitable 'index function', by 
use of which one may recognise when a proposed calculation is unnecessary; this is 
sketched in Subsection 5.3. Our index function is presented in Section 6, and is based 
on the nauty package developed by Brendan McKay in the context of the graph- 
isomorphism problem. 
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5.2 Evaluating M^{E) 

We refer to the triple {E^ X, Y) as a 'counting problem', and we shall need to compute 
My{E) for general counting problems. Note first that 



whence it suffices to compute certain M^{D). A brute- force strategy for calculating 
My{E) is to enumerate all valid forests F of the same size before proceeding to 
the next layer in the hierarchy of graphs sandwiched between X and E — Y. Let 
J^g = {F e !F-^{E — Y) : \F\ = s} denote the layer of valid forests of size s for 
s = \X\,...,\E-Y\. The partition the set {E - Y). 

The construction of the layers is described inductively, noting that J^\x\ = 
{X}. Having found JF^, we construct J-'s+i by considering every F G JF^ in turn. 
The forest F can be thought of as a (vertex- and edge-) disjoint union of connected 
components Cj. We seek to enlarge F by one edge without forming a cycle. Every 
such augmentation arises by merging two distinct components (whereas adding an 
edge to any given component must form a cycle). Thus, from every pair Ci, Cj of 
distinct components, we obtain new elements F' of J-s+i by adding an edge joining 
Ci to Cj. 

We have now that 



5.3 Indexing the counting problems 

The ensuing algorithm is very time-consuming when X is small and E — Y is large, 
since the layers J-g grow fast in n and s. It is therefore necessary to reduce to a 
minimum the number of its calls. We do this by devising some 'index function' i that 
indexes classes of counting problems {E,X,Y) having a common value of My^E). 
Let this value be for all problems with index satisfying i{E,X,Y) = l. Thus, 
for each i, one computation of suffices; the obtained value may be buffered in a 
database and retrieved to give My for all problems (F, X, Y) with the same index l. 

There are many possible choices of such a function i, and we seek one whose range 
is small and whose values are easy to compute. We show in the next section how 
such a function may be defined with the following key properties. Let (F, X, Y) and 
(F', X\ Y') be two counting problems such that z(F, X, Y) = i{E', X', Y'). Then, in 
ways to be made more precise in Section 6, 

(Kl) X and X' as well as E-(X + Y) and E' - {X' + Y') have similar structure, 
respectively, and 

(K2) X and F — {X + Y) are composed in a way similar to X' and E' — {X' + Y'). 
Properties (K1)~(K2) imply as required that M^(F - F) = M^' {E' - Y'). 



(5.2) 



M^(F) = M^(F-y), 



\E-Y\ 



(5.3) 
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5.4 Recursive formula for MyiD) 

A large number of explicit computations of (D) for X C D C E remain inevitable. 
The cost of computing M^{D) can be large when X is small, and grows very fast 
in the size of D. It is therefore preferable to expand this quantity into a sum over 
certain {D') for larger sets X' of constraints and smaller edge sets D'. These 
may be faster to compute, or may even be available from earlier computations. By 
partitioning the set of forests of D according to whether or not they contain a given 
edge d E D — X , we see that 

(5.4) M^{D) = M^{D-d)+M^+'^{D) for deD\X. 

By assumption, X C D, and we write D — X = {di, (i2, • • ■ , dk}- Let Xj = X + 
{di, d2, ■ ■ ■ 1 dj-i} and Dj = D — dj. By iteration of (5.4), 

k 

(5.5) M^{D) = J2M^'iDj) + M^{D), 

a formula which substantially aids numerical economy. 
6. Canonical labelling 

We sketch next how to label counting problems in order to build up a database 
of problems considered so far. We make substantial use of concepts and methods 
introduced by Brendan McKay in [19] and implemented in his nauty package to be 
found at |http : //cs . anu . edu . au/ ^bdm/ nauty/| , and we begin with some terms. 

6.1 Partitions 

A partition tx = (Fi, V2, . . . , Vjt) of the set F is a sequence of disjoint non-empty 
subsets of V whose union is V . The set of all partitions of V will be denoted by 
n(l/). If V is the vertex set of a graph G = {V,E), and tt G n(y), then the pair 
(G, tt) is termed a partitioned graph. The components Vi of a partition n are called 
its cells. Note that the order of the cells is significant, but the order of the vertices 
within each cell is not. A singleton cell is a cell Vi = {v} with a single element v, 
which is said to be fixed by n. If every cell of n is trivial, then tt is called discrete. If tt 
contains only one cell Vi = V, then tt is called the unit partition. A partition tt with 
k cells may be identified with a vertex-colouring c : V {1,2, by assigning 
colour i to all vertices in the ith cell Vi. In this alternative language, cells are also 
called colour classes. 

6.2 Permutations 

Let 7 be a permutation of V. The image of v & V under 7 will be denoted v"^ . 
Similarly, for U CV.we write = {u^ : u E U}. If tt = (Fi, ^2, • • • , Vk) E U{V), we 
set n'^ = {Vi , V2 , . . . , V^). If G = {V, E) is a graph, then G"' denotes the graph in 
which vertices and w'^ are adjacent if and only if v and w are adjacent in G. Since 
a permutation 7 acts on unordered pairs {w, v} of vertices, it acts also on edges. We 
now see that = {V^E-^). 
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Two partitions 7ri,7r2 G n(y) are called compatible if there exists a permutation 
7 of y such that = tt^. Thus tti and are compatible if and only if the vectors 
of cardinalities of their cells are the same. Two graphs Gi, 6*2 ^ G{V) are said to be 
isomorphic, written Gi ~ G2, if there exists a permutation 7 of F such that G2 — Gj. 
If this holds, we call 7 an isomorphism of Gi to G2- If the graphs Gi carry partitions 
TTi, we call the isomorphism 7 partition-preserving if, in addition, 712 ~ ttJ. 

The automorphism group Aut(G, re) of a partitioned graph (G, tt) is the set of all 
permutations 7 such that G'^ = G and tt"^ = tt. Since the order of cells in partitions 
is significant, the latter condition requires that 7 fixes the cells in tt setwise. If n is 
the unit partition, Aut(G, tt) is the usual automorphism group Aut(G) of G. 

6.3 Canonical labelling 

A canonical labelling map is a function C : Q{V) x 11(1/) G{V) such that, for any 
graph G — {V, E), any partition tt of V, and any permutation 7 of V, we have 

(CI) C(G',7r)~G', 

(C2) C{G'^,7t'^) ^C{G,7t), and 

(C3) if C{G, 71^) = C{G, tt), there exists S G Aut(G) such that tt'^ = tt-^. 
It was established in [19] that a canonical labelling map exists, and such a map may 
be found which is generally easy to compute. Henceforth, we assume that C is such 
a map. If tt = (V) is the unit partition, we may write C{G) for C(G, (V)). For 
G = (y, E) and a partition tt, we sometimes write C{E, tt) for C{G, tt). 

The main use of a canonical labelling is to solve certain graph isomorphism prob- 
lems, as illustrated by the following theorem which we shall use later. 

Theorem 6.1 [19, Thm 2.2]. Let Gi,G'2 G g{V) be graphs, 7ri,7r2 G U{V) compat- 
ible partitions, 7 a permutation of V , and let C be a canonical labelling map. Then 
C{Gi,7Ti) — C{G2,iT2) if and only if there exists a partition-preserving isomorphism 
from Gi to G2- 

6.4 Sufficient condition for M-^(D) = M^' {D') 

We shall next define the index function i referred to in Section 5. Let D,D' C K 
and let X C D, X' C D' . We require a rather rigid criterion on the composition of 
the pairs X,D -X and X', D' - X' in order to guarantee that M^{D) = M^'{D'). 
The condition that we shall present amounts to requiring that the pairs X, X' and 
D — X, D' — X' are each isomorphic, and in addition that X, D — X and X' , D' — X' 
'fit together' in the same manner. It will be convenient to achieve this by use of the 
notion of a partition, and of McKay's canonical labelling map C. 

Let C{D) denote the canonically-labelled isomorph of D (with unit partition), so 
that C{D) = for some permutation 5 of V; we fix such S = S{D) henceforth. Given 
two edge sets A,B <Z K, we define their interface I{A, B) to be the set of vertices v 
such that there exist a G ^, 6 G -B with both a and h incident with v. Elements of 
T{A, B) are called interface vertices. Thus T{X, D — X) represents the interface of 
the complementary subgraphs X and D — X of D. 

We wish to 'fix' this interface whilst allowing portions supported by vertices 'inside' 
X and D — X to vary up to isomorphism. For this purpose, we colour each vertex 
V G X{X,D — X) with its label in the canonical isomorph of D, namely v^. This 



12 



G. R. GRIMMETT AND S. N. WINKLER 



induces a colouring c : y — > {0, 1, . . . , n} of the vertex set of D, each vertex v taking 
colour if V E D — X) and (black) otherwise. 

The colouring c gives rise to a partition of V as follows. For < z < n, we set 
Vi — {v E V : c{v) — i}- If i ^ 0, then Vi is either empty or a singleton cell. 
Dropping all empty cells and relabelling the others while retaining the original order, 
we obtain a partition denoted tt = 7r{X, D — X,S) of the form (Vq, {ci}, . . . , {ck}), 
where k = \I{X,D-X)\. 

The required index function i is given in the next theorem. 

Theorem 6.2. Let {E, X, Y) be a counting problem, let D = E—Y and tt = 7r(X, D — 
X,5) where 5 — 5{D). Define the index function 

iiE, X, Y) = {\I{X, D - X)|, C(X, tt), C{D - X, tt)). 

// i{E, X, Y) = i{E', X', Y') then {E) = M^,' {E') . 

Since My (i?) = M-^{E — F), see (5.2), we shall have recourse to indices of the 
form i{E — Y, X, 0) which we write as j{E — Y,X). Thus 

(6.3) j{E',X)=i{E',X,0). 

Proof. Assume that i{E,X,Y) = i{E',X',Y'). Since \I{X,D - X)\ = \I{X',D' - 
X')\, the partitions tt and n' are compatible, and thus we may apply Theorem 6.1 to 
the pairs (X, tt), (X',7r'), and (L> — X, tt), (£>' — X', tt'). We deduce from the equality 
of i{E, X, Y) and i{E', X', Y'), and Theorem 6.1, that there exist partition-preserving 
permutations p : {X, tt) ^ {X', tt') and a : {D - X,7r) ^ {D' - X', tt'). Thus 

X' = XP, H' = TT^, 

^^■^^ D'-X' = {D-XY, tt'^tt". 

Since tt' = tt^ = tt'^, and since each point in T{X\ D — X') is a singleton cell of tt', 
we have that the permutations p and a agree on the interface I{X, D — X). 

For F C K, we write V{F) for the set of endpoints of edges in F. Thus, V{F) = 
V — V{F) is the set of isolated vertices of the graph {V, F). We define rj : V ^ V hy 

{vP = v'' iiv eX{X,D - X), 
vP iiv eV{X)-X{X,D-X), 

v'^ otherwise, 

and we claim that r] is an isomorphism from (V, D) to {V, D'). We check three facts 
in order to verify this. 

(i) We claim that 77 is one-one. Suppose on the converse that x ^ y and x"^ = 
y'^. Since p and a are permutations, we may label x, y in such a way that 
X e V{X) - I{X, D-X) andy eV - V{X). Since x E V{X), we ha ve that 
E V{X'). Since y eV - V{X), we have that E V{D' - X') + VW). 
Since x"^ — y"^, we deduce that x^ — y"^ E X{X',D' — X'). Now p, a are 
partition-preserving, whence x = y E I{X, D — X), a contradiction. 
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(ii) We claim that rj maps edges to edges. Let {x,y) e D. Either {x,y) e X, in 
which case {x,y)'^ — (x^^yP) G X', or E D — X, in which case {x,y)'^ = 
{x"", y"") e D' - X'. In either case, (a;'', y"^) e D' . 

(iii) We claim that rj maps non-edges to non-edges. Assume that {x^,y^) G D' . If 
{x^^y'^) G X', then {x'^,y'^) = (x^.y^) whence {x,y) E X. A similar argument 
holds with X, p replaced hy D — X, a. 

In summary, ?7 is a permutation of V such that e G D if and only is E D' . Therefore, 
r] is an isomorphism. 

It is easily deduced that rj induces a one-one correspondence between forests of 
D containing X and forests of D' containing X', whence (D) — {D') as 
required. □ 

We make a peripheral remark regarding efficiency. In the above construction of 
the partition tt = 7r{X, D — X,S), it is not relevant for the validity of Theorem 6.2 
to colour each vertex v G D — X) precisely with its label in the canonical 

isomorph d = S{D) of D. In principle, we could have chosen arbitrary distinct colours, 
thus removing the dependence of tt on 5. 

However, these colours determine the order in which interface vertices appear as 
singleton cells in the induced partition tt. This order is in turn significant when 
applying the canonical labelling map C. By consistently choosing the colour we 
maintain the same order across the whole class of isomorphs of D. As a consequence, 
i sub-divides the space of counting problems {E, X, Y) into larger classes, each with 
a common value i{E, X, Y). 

One drawback of the fact that the partition tt depends on S{D) is that we have to 
recompute each of the three components of i whenever a new graph D is considered 
and the corresponding S{D) affects tt, even when X or D — X have not themselves 
changed. Whether or not the net effect is an improved performance will therefore 
depend essentially on the speed of evaluation of the canonical labelling map C. 



7. Redundancy in conjecture instances 

We are required to check (5.1) for all conjecture instances {E,e,f). Many such in- 
stances are isomorphic to one another, and thus it becomes necessary to devise a 
criterion for deciding, given two instances one of which has already been resolved, 
whether (5.1) holds automatically for the second. 

We call the conjecture instances (£", e, /) and (£", e', /') equivalent., and write 
{E,e,f) ~ (£", e',/'), if the left and right hand sides of (5.1) are invariant under 
interchange of both sets of parameters, that is, if 

M^'^^^\E)M^^j^{E) = M^^''f'\E')M^^,j,^{E'), and 
Mf{E)Mi{E) = Mf,{E')M^,'{E'). 

We propose two measures to reduce complexity: (a) to make a selection of con- 
jecture instances which ab initio avoids many equivalent instances, and (b) to skip 
instances identified as equivalent to earlier ones, using the index function i of Section 
6. 
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7.1 Efficient selection 

We begin by fixing e and / in one of the two categories of conjecture instances. Thus 
we take Ei = {ei,/i} = {(1, 2), (1, 3)} and E2 = {es, /s} = {(1, 2), (3, 4)}. It clearly 
suffices to verify (5.1) for all E containing Ei, respectively i?2- 

Let 8s ^ {E (Z K : \E\ ^ s, Ei C E for some i}, 2 < s < N. We shaU not 
work with every member of £s for each s, since the size of Eg grows exponentially 
with n and s, and in addition many of its members give rise to equivalent conjecture 
instances. Instead we shall work with a subset Si. C defined as follows. 

For E C let Vmin = ^min(-E') bc the vertex having the least label which is 
isolated in E. If no vertex is isolated, we set fmin = n. We say that E has the 
property Hmin if each of the vertices Vmin, I'min + l,...,n— l,nis isolated in E. 

We set £2 = S2 = {Ei,E2}, and we make £2 an ordered set by decreeing that 
El < E2. Note that every member of £2 has property Umin- Having found the (totally) 
ordered set £g C £g for some given s, we construct the ordered set Egj^i C £s+i in the 
following iterative manner. Assume that every member of £g has property Ilmin. Let 
E e £g, and let Fe be the subset of K given by 



(7.2) 



:l<i<j< fmin} + {(t'minjt'min + 1)} 



-E. 



We obtain £'g^i by adding each such / e Fe to each such E, that is, 



(7.3) £Ui= U U {/ + ^>- 

Ee£', feFE 

Note that every member of £g^i has property Hmin- Furthermore, £s^i is ordered as 
follows. For E', E" E E'^ and /' G Te', f" e Te", we have f + E' < f" + E" if: 
either E' < E" in £'g, or E' = E" and /' < 

It may be seen by an iterative argument involving the quantity s that, for every 
E e £s, there exists E' e £'g such that: there exists a graph isomorphism from E to E' 
which fixes the edges and fi. It is thus sufficient to consider conjecture instances 
{E,ei,fi), (£',625/2) for E belonging to some £i,. Any other conjecture instance 
is equivalent to some such case. This process of selection reduces the number of 
conjecture instances, and is effective for small s. When s is large, we require the 
following further economy. 

7.2 Skipping equivalent instances 

Having constructed the £g, we must verify all conjecture instances {E,ei,fi) with 
E e £g and Ei C E. However, the cardinalities of the £g grow prohibitively fast 
despite their selectivity, of the order (^), and in any computer implementation this 
will tend to exhaust the available resources of memory. In order to address this critical 
issue, we require a criterion to identify equivalent instances cheaply. This will enable 
us to further reduce £g by deleting graphs E for which both instances {E, ci, /i) and 
{E, 62, /2) are equivalent to earlier ones. We use the following theorem. 



Theorem 7.4. Let E,E' <Z K and let e,feE, e', f e E' he such that e ^ f , 
e' ^ f. Ifj{E, {e, /}) = j{E', {e', /'}) then: 
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(a) (E, e, /) ~ {E', e', f), and furthermore 

(b) for every HD E there exists H' D E' such that (H, e, /) ~ (H', e', f). 

Proof. It suffices to prove part (b), since this contains (a). Let H D E, and recall 
the definition (7.1) of equivalence. Let p, a, and rj be as in the proof of Theorem 
6.2. We write X = {e, /}, X' = {e', /'}. In the present context, X' — Xp reads 
{e',/'} = {e,/}^, so that either e'' = e', = /', or e'' = /'^ = e' . We set 
H' = H^. It is now clear that i] provides one-one correspondences between the pairs 
J^^{H), T^'{H') and Tx{H), J^x'iH'), whence 

|^^(iy)| = |.F^'(iy')| and \J^AH)\ = \^x'iH')\. 

Furthermore, if = e' , — f (respectively, e"^ = /', = e'), then i] provides 
one-one correspondences between the pairs J-'^{H), J^j,{H') and J^[{H), T^^, {H') 

(respectively, the pairs !F^{H), !F^, [H') and !Fl{H), J^^',{H')). In either case, we 
have that 

\j^fm . \Tf{H)\ = \Ti,{H')\ . \Ti:{H')\. 

The condition of (7.1) follows. □ 

This result is used as follows. Whenever we do the computations required for 
(5.1) for a given conjecture instance {E,e,f), we place the index j{E,{e, f}) in a 
certain database, which we term the index database. If later we encounter a con- 
jecture instance (£", e', /') for which j{E', {e', /'}) lies already in this database, say 
j{E', {e', /'}) — j{E, {e, /}), we may deduce from Theorem 7.4 that (5.1) is equally 
valid for {E',e',f') as for {E,e,f). Thus we need not consider the new instance 
(£", e',/'). Indeed, by Theorem 7.4(b), we need consider no instance of the form 
{H\e'J') for H' D E' . 

This observation may be used recursively in order to reduce the ordered sets E'^ 
still further. Rather than working with the 8'^ given in (7.3), one works with ordered 
subsets 8'^ C 8'g, constructed as follows. We set 82 — 82. Suppose that we have found 
82, 8^, . . . , 8'J, and we have checked that (5.1) holds for all instances {E, e^, fi) with 
E G 8", t < s, and all i such that Ei = {ei, fi} C E. We cycle through the sets 
E E 8'J . For each such E, we cycle through the edges / belonging to the ordered set 
Fe of edges defined at (7.2). If both the indices j{f + E, E^), i — 1,2, lie already in 
the index database, we do not add the set f + E to 8'J^i. If either of the indices do 
not belong to the database, then we add f + E to 8'J_^i, and we check any conjecture 
instance (/ + E, Si, fi) with j{f + E, Ei) not in the index database. Thus we may 
write 

c+i= U U {/+^>' 

where F'^ is the set of all / G Fe for which one or both of the above indices do not 
currently lie in the index database. 
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